Downloads
This page hosts bulk downloads for all transcriptions factors in the UniPROBE database, organized by publication, and for the SQL tables holding PBM data or factor annotations. These files can be quite large, so please be prepared for significant download times. Please note that any files which contain PBM probe sequences are protected by an academic research use license, which will require confirmation prior to file download. Additionally, the various papers and datasets covered in this database have employed a variety of universal array designs created using various deBruijn sequences. Users interested in the 60-mer probe sequences should download the probe sequences associated with the particular datasets or papers of interest.
If you wish to download individual files for certain transcriptions factors, you can find them most easily using the browse page, which provides an interface for text-based searches. The site's public directory index also makes all database files easily accessible.
All Data
Every download in this section contains a file for each protein in the database.
- Ungapped 8-mer Data
-
The enrichment scores of the contiguous 8-mers as calculated for each protein.
- PWMs
-
The position weight matrices (PWMs) for each experiment. These matrices hold frequency values.
- Normalized Probe Data (60-mer signal intensities and sequences)
-
The median signal intensity values and corresponding nucleotide probe sequences.
- Raw Probe Data.
-
Text files containing the unprocessed output from the PBM array runs.
By Publication:
Grove, De Masi et al., Cell 2009
- All Data
-
This download includes all experimental data and computational analyses from Grove, De Masi, et al's compilation of the C. elegans bHLH and dimers PBM results that are available on UniPROBE.
- All Ungapped 8-mer Data
-
The enrichment scores associated with each contiguous 8-mer for each bHLH factor and dimer. The columns are as follows:
3 columns
1: 8-mer sequence
2: Reverse complement of 8-mer sequence
3: Enrichment score
Separate files for each bHLH factor and dimer: Download
- Top Gapped 8-mer Data
-
Enrichment scores, median signal intensities, and z-scores for the gapped 8-mers with enrichment scores above a 0.25 threshold.
- PWMs
-
The position weight matrices (PWMs) for each experiment. These matrices hold frequency values.
Separate files for each factor. Files ending in _RC.pwm contain the reverse complement of the forward PWM: Download
Single text file containing all PWMs: Download
- Motif Logos
-
The motif logos derived from the corresponding position weight matrices. Files ending in _RC.png show the logo for the reverse complement of the forward motif.
- 60-mer Probe Sequences
-
Holds separate text files for each bHLH factor and dimer, where a given file contains the normalized signal intensities and nucleotide sequences of PBM probes (designed from our 'all 10-mer' de Bruijn sequences)
Badis, Berger, Philippakis, Talukder, Gehrke, Jaeger, Chan, et al., Science 2009
- All Data
-
This download includes all experimental data and computational analyses from the most recent compilation of PBM results. These files include both raw and processed files from PBM array version 1 (v1) arrays, raw and processed files from array version 2 (v2) arrays, and the results of analyses performed to integrate the v1 and v2 array data. (Warning: this file is very large (~1.7GB) and may take several hours to download).
- All Ungapped 8-mer Data
-
Median signal intensities, enrichment scores, z-scores, p-values for z-scores, Q-values for the enrichment scores, and Q-values for the z-scores associated with each contiguous 8-mer. This file holds separate text files for each TF, where a given file has rows for each contiguous 8-mer. The columns are as follows:
20 columns:
1: 8-mer sequence
2: complement of 8-mer sequence
3: Median Signal Intensity Version 1
4: Enrichment Score Version 1
5: Z-Score (MAD estimation of sd) Version 1
6: Median Signal Intensity Version 2
7: Enrichment Score Version 2
8: Z-Score (MAD estimation of sd) Version 2
9: P-value for Z-Score Version 1
10: P-value for Z-Score Version 2
11: P-value for combined Z-Score from Array Versions 1 and 2
12: P-value for Enrichment Score Version 1
13: P-value for Enrichment Score Version 2
14: P-value for average Enrichment Score from Array Versions 1 and 2
15: FDR Q-value for Z-Score Version 1
16: FDR Q-value for Z-Score Version 2
17: FDR Q-value for combined Z-scores (Z-Score Version 1, Z-Score Version 2)
18: FDR Q-value for Enrichment Score Version 1
19: FDR Q-value for Enrichment Score Version 2
20: FDR Q-value for averaged Enrichment Score from Array Versions 1 and 2: Download
Table of all median intensity values, array v1: Download
Table of all median intensity values, array v2: Download
Table of all enrichment scores, array v1: Download
Table of all enrichment scores, array v2: Download
Table of all enrichment scores, combined: Download
Table of all normalized z-scores, array v1: Download
Table of all normalized z-scores, array v2: Download
- Top Gapped 8-mer Data
-
Enrichment scores, median signal intensities, and z-scores for the gapped 8-mers with enrichment scores above a specified threshold.
- PWMs
-
The position weight matrices (PWMs) for primary and secondary motifs. These files are in log-likelihood format, such that the value in each cell represents the frequency that a given nucleotide is predicted be present at the specified position.
Separate files for each factor: Download
Single text file containing all PWMs: Download
- Motif Logos
-
The motif logos derived from the corresponding position weight matrices.
- Raw Probe Data.
-
Text files containing the unprocessed output from the PBM array runs.
- 60-mer Probe Sequences
-
Holds separate text files for each TF, where a given file contains the normalized signal intensities and nucleotide sequences of PBM probes (designed from our 'all 10-mer' de Bruijn sequences)
Lesch et al., Genes & Dev. 2009
- All Data
-
This download includes all experimental data and computational analyses from this publication that are included in UniPROBE.
- All 8-mer Data
-
The median signal intensities and enrichment scores associated with each contiguous 8-mer.
Single text file for all median intensities: Download
- PWMs
-
The position weight matrices (PWMs) for each experiment. These matrices hold frequency values.
Single text file containing all PWMs: Download
- Motif Logos
-
The motif logos derived from the corresponding position weight matrices.
- 60-mer Probe Sequences
-
Contains the normalized signal intensities and nucleotide sequences of PBM probes (designed from our 'all 10-mer' de Bruijn sequences).
Zhu, Byers, McCord, et al., Genome Research 2009
- All Data
-
This download is a compilation of all the yeast PBM experimental data and computational analyses available on UniPROBE. Please note that this file is very large (>.4GB) and may take several hours to download.
- All Ungapped 8-mer Data
-
Median signal intensities and enrichment scores associated with each contiguous 8-mer for each transcription factor. There are separate files for each transcription factor. The columns are as follows:
4 columns
1: 8-mer sequence
2: Reverse complement of 8-mer sequence
3: Median signal intensity
4: Enrichment score
- Top Gapped 8-mer Data
-
Enrichment scores, median signal intensities, z-scores for the gapped 8-mers with enrichment scores above a specified threshold.
- PWMs
-
The position weight matrices (PWMs) for each experiment. These matrices hold frequency values.
Separate files for each factor: Download
Single text file containing all PWMs: Download
- Motif Logos
-
The motif logos derived from the corresponding position weight matrices.
- 60-mer Probe Sequences
-
Holds separate text files for each TF, where a given file contains the normalized signal intensities and nucleotide sequences of PBM probes (designed from our 'all 10-mer' de Bruijn sequences).
Scharer et al., Cancer Res. 2009
- All Data
-
This download includes all experimental data and computational analyses from this publication that is available on UniPROBE.
- All 8-mer Data
-
The median signal intensities and enrichment scores associated with each contiguous 8-mer.
Single text file for all median intensity scores: Download
- PWMs
-
The position weight matrices (PWMs) for each experiment. These matrices hold frequency values.
Single text file containing all PWMs: Download
- Motif Logos
-
The motif logos derived from the corresponding position weight matrices.
- 60-mer Probe Sequences
-
Contains the normalized signal intensities and nucleotide sequences of PBM probes (designed from our 'all 10-mer' de Bruijn sequences).
Pompeani et al., Mol Microbiol 2008
- 60-mer Probe Sequences
-
The normalized signal intensities and nucleotide sequences of PBM probes (designed fom our 'all 10-mer' de Bruijn sequences) for LuxR. There is no other factor data available from this publication.
Berger, Badis, Gehrke, Talukder, et al., Cell 2008
- All Data
-
This download includes all experimental data and computational analyses from the most recent compilation of the PBM homeodomain results that are available in UniPROBE. (Warning: this file is very large (~1.6GB)).
- All Ungapped 8-mer Data
-
The enrichment scores, median signal intensities, z-scores, p-values for the enrichment, and Q-values for the enrichment associated with each contiguous 8-mer.
Separate files for each factor, where each file contains a column for each of the above statistics: Download
Single text file for all enrichment scores: Download
Single text file for all normalized intensity scores: Download
Single text file for all calculated z-scores: Download
Single text file for all p-values for enrichment: Download
Single text file for all Q-values for enrichment: Download
Single text file for all Q-values for the z-scores: Download
- Top Gapped 8-mer Data
-
Enrichment scores, median signal intensities, and z-scores for the gapped 8-mers with enrichment scores above a specified threshold.
- PWMs
-
The position weight matrices (PWMs) for each experiment. These matrices hold frequency values.
Separate files for each factor: Download
Single text file containing all PWMs: Download
- Motif Logos
-
The motif logos derived from the corresponding position weight matrices.
- Raw Probe Data.
-
Text files containing the unprocessed output from the PBM array runs.
- 60-mer Probe Sequences
-
Holds separate text files for each TF, where a given file contains the normalized signal intensities and nucleotide sequences of PBM probes (designed from our 'all 10-mer' de Bruijn sequences).
De Silva et al., PNAS 2008
- All Data
-
This download includes all experimental data and computational analyses from this publication available in UniPROBE.
- All Ungapped 8-mer Data
-
The median signal intensities and enrichment scores associated with each contiguous 8-mer.
Separate files for each factor: Download
Single text file for all enrichment scores: Download
Single text file for all median intensity scores: Download
- PWMs
-
The position weight matrices (PWMs) for each experiment. These matrices hold frequency values.
Separate files for each factor: Download
Single text file containing all PWMs: Download
- Motif Logos
-
The motif logos derived from the corresponding position weight matrices.
- Raw Probe Data.
-
Text files containing the unprocessed output from the PBM array runs.
Berger, Philippakis, et al., Nat. Biotech. 2006
- All Data
-
This download includes all experimental data and computational analyses from this publication that is available on UniPROBE.
- All Ungapped 8-mer Data
-
The enrichment scores associated with each contiguous 8-mer.
Separate files for each factor: Download
Single text file for all enrichment scores: Download
- Top Gapped 8-mer Data
-
The enrichment scores for the gapped 8-mers (up to 11 bp length) with enrichment scores above 0.25.
- PWMs
-
The position weight matrices (PWMs) for each experiment. These matrices hold frequency values.
Separate files for each factor: Download
Single text file containing all PWMs: Download
- Motif Logos
-
The motif logos derived from the corresponding position weight matrices.
- Raw Probe Data.
-
Text files containing the unprocessed output from the PBM array runs.
Separate files for each factor: Download
Excel spreadsheet containing raw data for all factors: Download
- 60-mer Probe Sequences
-
Microarry probes ranked by their normalized signal intensities (designed from our 'all 10-mer' de Bruijn sequence).
Separate files for each factor: Download
Excel file containing data for all factors: Download